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A  MONTE  CARLO  INVESTIGATION  OF  ECONOMETRIC  MODELS  WITH  FIXED  AND 

STOCHASTIC  REGRESSORS  WHERE  THE  ERROR  TERMS  ARE  AR(l)  AND  MA(i) 

~  A  Monte  Carlo  study  of  four  specialized  regression  procedures 
is  reported.  Two  of  the  procedures  are  designed  for  a  simple  linear 
econometric  model  while  the  other  two  procedures  are  designed  for  a 
single  period  lagged  endogenous  variable  and  single  exogenous 
variable  econometric  model.  For  the  simple  linear  model,  the  small 
sample  properties  of  the  Pesaran  procedure,  designed  for  an  MA(l) 
error  term,  and  the  Beach-MadCinnon  procedure,  designed  for  an  AR(l) 
error  term,  are  compared  against  the  small  sample  properties  of  CSLS 
and  against  those  of  the  Prals-Winston  procedure .  For  the  lagged 
endogenous  model,  the  small  sample  properties  of  the  Zellner-Geisel 
procedure,  designed  for  an  MA(l)  error  term,  and  the  Wallis  procedure, 
designed  for  an  AR(l)  error  term,  are  likewise  compared  against  the 
small  sample  properties  of  OLS  and  of  Prais-Winston.  In  addition,  the 
power  of  the  Durbin-Watson  d  test  is  analyzed  for  both  models  and  the 
Durbin  h  and  the  McNown  tests  are  analyzed  for  the  lagged  endogenous 
model.  For  the  simple  linear  model,  the  Prais-Winston  transformation 
is  only  performed  if  the  Durbin-Watson  d  test  indicates  the  hypothesis 
of  no  autocorrelation  should  be  rejected.  For  the  lagged  endogenous 
model,  the  transformation  is  only  performed  if  the  McNown  test 
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indicates  the  hypothesis  should  be  rejected.  Finally,  to  analyze  the 
effects  of  misspecification,  the  two  specialized  procedures  for  each 
model  are  applied  to  the  wrong  error  structure. 
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Anselmi,  Michael  Stephen  (Ph.D.,  Economics) 

A  Monte  Carlo  Investigation  of  Econometric  Models  with  Fixed  and 
Stochastic  Regressors  Where  the  Error  Terms  are  AR(l)  and  MA(l) 
Thesis  directed  by  Professor  J.  Malcolm  Dowling 

A  Monte  Carlo  study  of  four  specialized  regression  procedures 
is  reported.  Two  of  the  procedures  are  designed  for  a  simple  linear 
econometric  model  while  the  other  two  procedures  are  designed  for  a 
single  period  lagged  endogenous  variable  and  single  exogenous 
variable  econometric  model.  For  the  simple  linear  model,  the  small 
sample  properties  of  the  Pesaran  procedure,  designed  for  an  MA(l) 
error  term,  and  the  Beach-MacKinnon  procedure,  designed  for  an  AR(l) 
error  term,  are  compared  against  the  small  sample  properties  of  OLS 
and  against  those  of  the  Prais-Winston  procedure .  For  the  lagged 
endogenous  model,  the  small  sample  properties  of  the  Zellner-Geisel 
procedure,  designed  for  an  MA(l)  error  term,  and  the  Wallis  procedure, 
designed  for  an  AR(l)  error  term,  are  likewise  compared  against  the 
small  sample  properties  of  OLS  and  of  Prais-Winston.  In  addition,  the 
power  of  the  Durbin-Watson  d  test  is  analyzed  for  both  models  and  the 
Durbin  h  and  the  McNown  tests  are  analyzed  for  the  lagged  endogenous 
model.  For  the  simple  linear  model,  the  Prais-Winston  transformation 
is  only  performed  if  the  Durbin-Watson  d  test  indicates  the  hypothesis 
of  no  autocorrelation  should  be  rejected.  For  the  lagged  endogenous 
model,  the  transformation  is  only  performed  if  the  McNown  test 
indicates  the  hypothesis  should  be  rejected.  Finally,  to  analyze  the 
effects  of  misspecification,  the  two  specialized  procedures  for  each 
model  are  applied  to  the  wrong  error  structure .  The  Pesaran 
procedure  was  superior  to  CLS  and  Prais-Winston  in  the  estimation  of 
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the  coefficient  of  the  exogenous  variable  but  inferior  in  the 
estimation  of  the  coefficient  of  autocorrelation.  It  did  remove  the 
autocorrelation  under  both  error  structures.  The  Beach-MacKinnon 
procedure  demonstrated  results  similar  to  those  of  Fesaran.  For  both 
MA(l)  and  AR(l)  autocorrelation,  the  Zellner-Geisel  procedure  was  far 
superior  to  Prais-Winston  and  better  than  CLS  for  large  values  of  the 
coefficient  of  autocorrelation.  It  did  appear  to  remove  M(l)  auto¬ 
correlation  from  the  residuals  but  was  not  very  successful  in  removing 
AR(l)  autocorrelation.  The  Wallis  procedure  was  inferior  to  QLS  for 
both  error  structures.  In  certain  circumstances,  the  Wallis  procedure 
provided  better  estimates  than  Prais-Winston.  In  terms  of  removing 
autocorrelation,  the  Wallis  procedure  did  not  do  an  acceptable  job  for 
either  error  structure .  All  three  tests  for  autocorrelation  proved  to 
be  nearly  equal  in  power  when  the  error  term  was  AR(l).  When  the 
error  structure  was  MA(l),  the  Durbin-Watson  d  test  exhibited 
remarkable  power.  For  the  lagged  endogenous  model,  the  a  test 
outperformed  the  Durbin  h  test  which  outperformed  the  McNown  test 
when  the  error  term  was  MA(l). 

This  abstract  is  approved  as  to  form  and  content.  I  recommend  its 
publication. 
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CHAPTER  I 


INTRODUCTION 

In  numerous  recent  investigations  in  the  literature,  error 
structures  in  the  linear  regression  model  other  than  the  first-order 
autoregressive  or  Markov  model  have  appeared.  In  particular,  a 
moving  average  (MA)  error  structure  has  been  applied  in  many  different 
areas:  Trivedi  (1970; 1973)  and  Burrows  and  Godfrey  (1973)  have  used 
an  MA  error  structure  in  their  analysis  of  inventory;  in  consumption 
dos  Santos  (1972)  and  Zellner  and  Geisel  (1970)  applied  an  MA  error 
structure;  Hess  (1973)  applied  it  to  durable  goods  demand;  Rowley 
and  Wilton  (1973)  to  wage  determination;  and  Pesaran  ( 1 973 )  to 
investment . 

Despite  this,  many  textbooks  and  applied  researchers  continue  to 
place  emphasis  on  an  autoregressive  (AR )  error  structure  as  the  only 
alternative  to  the  usual  assumption  of  residual  independence .  Of  the 
many  available  econometrics  textbooks,  only  a  few  devote  any  space  to 
a  discussion  of  autoregressive-moving  average  (ARM)  error  structures^ 
and  there  is  almost  no  mention  of  how  these  structures  may  arise  in 
applied  econometric  investigations.  For  example,  in  discussing 
the  familiar  Koyck  model  (195^),  several  textbooks  derive  a  reduced 

^See  for  example,  Pindyck  and  Rubinfeld  (1976). 


L 


2 


form  equation  similar  to  the  following: 

-  XYt-i  ♦  ea-\)xt  *  \  (l) 

where 

ut  =  vt'xVr  (2) 

If  v  is  serially  independent,  u  is  autocorrelated  and  follows  a 

X  X 

first-order  moving  average  (MA(l))  structure.  Nevertheless,  the 
autocorrelation  function  of  the  MA(l)  structure  is  seldom  derived 
or  estimation  methods  explored. 

There  are,  of  course,  good  reasons  for  the  popularity  of  the 
AR(l)  error  structure.  The  Durbin-Watson  d  test  (1950; 1951)  and 
Cochrane -Orcutt  (1949)  procedures  have  been  known  for  many  years  and 
are  built  into  most  canned  regression  packages  while  testing  and 
correction  procedures  for  the  more  general  class  of  ARMA  models 
have  only  recently  been  developed.  Moreover,  efficient  estimation 
of  these  alternative  error  structure  models  typically  requires  the 
use  of  maximum  likelihood  or  nonlinear  least  squares  computer  routines 
sufficiently  complicated  to  disuade  researchers  who  find  it  so  easy  to 
use  the  canned  routines. 

In  addition  to  the  limited  discussion  and  use  of  alternative 
error  structures,  Nichols,  Pagan  and  Terrell  in  a  recent  survey 
article  (1975)  report  a  ".  .  .  paucity  of  small  sample  studies.  .  .  ." 
Most  of  the  Monte  Carlo  studies  they  found  were  in  terms  of  a  pure 
MA  time  series  r  -1  with  no  independent  variable.  A  conspicuous 
exception  to  this  would  be  the  work  of  Hendry  and  Trivedi  (1972). 

In  an  attempt  to  learn  more  about  the  small  sample  properties  of 
models  -with  an  MA(i)  error  structure,  I  have  analyzed  via  Monte  Carlo 


techniques  two  simple  econometric  models  using  both  AR(l)  and  MA(l) 
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error  structures. 


Model  I 

Model  I  is  a  simple  linear,  two  variable  econometric  model, 

L  =  »t  +  V  O) 

If  u  %  NID ( 0 , <j ^ ) ,  the  ordinary  least  squares  (OLS)  estimate  of  g  is 
unbiased,  consistent  and  efficient.  If  u  is  autocorrelated,  the  OLS 

it 

estimate  of  g  is  still  unbiased  but  asymptotically  inefficient. 

Model  I  with  an  AR(l)  Error  Structure 

If  ut  in  (3)  follows  a  first-order  autoregressive  structure, 

Ut  =  pUt-l  +  vt 

where  -1  <  p  <  1  and  v  -v  NID(0,c?2),  then  a  maximum  likelihood  (ML) 

X>  V 

procedure  developed  by  Beach  and  MacKinnon  (1978)  will  provide 

2 

estimates  for  j3  and  p  that  are  asymptotically  efficient. 

I  compared  the  small  sample  results  of  the  Beach  and  MacKinnon 
procedure  to  OLS;  the  Beach-MacKinnon  procedure  is  described  in  detail 
in  Chapter  II  and  the  results  are  presented  in  Chapter  IV . 

I  will  henceforth  refer  to  this  model  cum  error  structure  as 
Model  I-AR,  that  is  Model  I  with  an  AR(l)  error  structure.  I  will 
also  refer  to  the  3each-MacKinnon  procedure  as  the  3-M  procedure . 

Hildreth  and  Bent  (1974)  also  have  a  procedure  that  provides 
efficient  estimates  but  the  Beach-MacKinnon  procedure  was  more  recent 
and  it  appears  to  use  fewer  computer  resources. 


Model  I  with  an  MA(l)  Error  Structure 

If  in  (3)  follows  a  first-order  moving  average  structure, 


ut  =  Vt  '  Xvt-1 


where  -1  <  \  <  1  and  v  ^  NID(0,<jy),  then  an  ML  procedure  developed 
by  Pesaran  (1973)  will  provide  estimates  for  3  and  A.  that  are 
asymptotically  efficient. 

I  likewise  compared  the  small  sample  results  of  the  Pesaran 
procedure  to  OLS;  the  Pesaran  procedure  is  described  in  detail  in 
Chapter  II  and  the  results  of  my  comparison  are  presented  in  Chapter 


Henceforth,  I  will  refer  to  this  model  as  Model  I-MA  and  to 
the  Pesaran  procedure  as  IES. 


Model  II 


Model  II  is  a  single-period  lagged  endogenous  variable  and 
single  exogenous  variable  econometric  model, 


h  =  @lYt-i  +  ^t  +  V 


If  u  <\>  NID(0,c2),  CLS  will  provide  consistent  estimates  for  p1  and 
3S.  If,  however,  Uj.  is  autocorrelated,  OLS  will  provide  biased, 
inconsistent  and  inefficient  estimates.  Additionally ,  Griliches 
(l96l)  and  others  have  shown  that  as  long  as  is  positive,  CLS 
will  contain  an  upward  bias  in  the  estimate  cf  3i> 
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Model  II  with  an  AR(l)  Error  Structure 

If  u  in  (6)  follows  the  AR(l)  error  structure  given  by  (4), 
then  a  generalized  least  squares  (GLS)  procedure  developed  by 
Wallis  (1967)  provides  consistent  but  not  fully  efficient  estimates  of 
and  (32 . 

I  compared  the  small  sample  results  of  the  Wallis  procedure  to 
OLSj  the  Wallis  procedure  is  described  in  detail  in  Chapter  III  and 
the  results  are  presented  in  Chapter  IV . 

Henceforth,  this  model  will  be  referred  to  as  Model  II -AR  and 
the  Wallis  procedure  will  be  referred  to  as  the  WAL  procedure. 

Model  II  with  an  MA(l)  Error  Structure 

If  u^  in  (6)  follows  the  MA(l)  error  structure  given  by  (5), 
then  an  ML  procedure  developed  by  Zellner  and  Geisel  (1970)  provides 
consistent  and  efficient  estimates  of  3i  and  3s • 

I  compared  the  small  sample  results  of  the  Zellr.er-Geisel 
procedure  to  OLSj  like  the  WAL  procedure,  the  Zellr.er-Geisel  procedure 
is  described  in  detail  in  Chapter  III  and  the  results  are  presented 
in  Chapter  IV. 

Model  II  with  an  MA(l)  error  structure  will  be  referred  to  as 
Model  II-MA  and  I  will  refer  to  the  Zellner-Geisel  procedure  as  the 
Z-G  procedure. 

Table  I  summarizes  the  models,  error  structures  and  procedures 
as  they  are  correctly  applied  to  each  other. 

Testing  for  Autocorrelation 

In  conjunction  with  the  comparisons  of  the  four  different 
procedures  and  OLS,  statistics  on  three  different  tests  for 
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TABLE  I 

ESTIMATION  PROCEDURES  CORRECTLY 
APPLIED  TO  MODELS  AND  ERROR  STRUCTURES 


Error  Structure 

Model 

mHa— M 

AR(l) 

Ut  =  pUt-l  +  Vt 

Beach-MacKinnon 

Wallis 

MA(l) 

Ut  =  Vt  -  Avt-1 

Pesaran 

Zellner-Geisel 

autocorrelation  were  collected.  All  three  tests  were  designed  to 
detect  the  presence  of  AR  autocorrelation  and  their  performance  when 
the  autocorrelation  was  MA(l)  was  monitored. 

For  Model  I,  a  Durbin-Watson  d  test  (1950; 1951)  wa s  performed 
after  both  the  OLS  regression  and  either  a  PES  or  B-M  regression. 

Since  both  PES  and  B-M  were  designed  to  remove  autocorrelation, 
you  would  expect  that  the  d  test  would  indicate  the  presence  of 
autocorrelation  after  OLS  but  not  after  HIS  or  B-M.  In  addition,  if 
the  d  test  indicated  the  presence  of  autocorrelation  after  an  OLS 
regression,  I  performed  a  Prais-Winston  transformation  and  regression 
procedure  (195*0  after  which  I  did  another  a  test.  The  Prais-Winston 
or  P-W  procedure  results  could  then  be  compared  to  the  OLS  and  either 

3 

I  chose  to  use  the  Prais-Winston  transformation  and  regression 
procedure  over  the  Cochrane -Orcutt  procedure  because  Prais-Winston 
retains  the  first  observation  and  Cochrane-Orcutt  does  not.  In 
addition,  both  Rao  and  Griliches  (1969)  and  Spitzer  (1979)  have  found 
the  Prais-Winston  procedure  to  be  one  of  the  best  two-stage 
procedures . 


«HHn 


? 

PES  or  B-M  results.  I  present  the  outcome  of  these  comparisons  in 
Chapter  IV . 

Similarly  for  Model  II ,  a  Durbin-Watson  d  test  was  performed 

after  both  the  QLS  and  either  Z-G  or  WA1  regressions.  But,  since  the 

Durbin-Watson  d  test  was  not  designed  to  detect  autocorrelation  in 

4 

models  with  lagged  dependent  variables,  a  Durbin  h  test  (1970)  and 
a  McNown  test  (1977)  were  also  performed  after  each  CLS  regression. 
Additionally,  a  Durbin  h  test  was  performed  after  a  WAL  regression. 

If  the  McNown  test  indicated  the  presence  of  autocorrelation  after  an 
OLS  regression,  a  P-W  type  transformation  was  performed  on  the  data 
and  another  CLS  regression  was  performed  on  the  transformed  data. 

The  outcome  of  the  comparisons  of  the  three  tests  for  autocorrelation 
and  of  the  comparisons  of  the  small  sample  properties  of  the  various 
regression  procedures  is  presented  in  Chapter  17. 

Effects  of  Misspecification 

I  was  also  interested  in  what  the  outcome  might  be  if  one  of 
the  four  specialized  procedures  was  applied  to  the  wrong  error 
structure.  For  example,  how  well  would  the  PES  procedure  remove  AB(i) 
autocorrelation  when  it  was  designed  to  handle  MA(i)  autocorrelation 
and  how  well  would  it  do  estimating  p  when  it  was  designed  to  estimate 
\ ?  This  might  occur  if  an  applied  researcher  assumed  or  specified 
that  his  error  term  was  MA(l)  and  used  PES  when  in  actuality  the 
error  term  was  AR(l). 

^See  Durbin  (l 970). 
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Table  II  shows  how  all  four  specialized  procedures  were 
incorrectly  applied  to  error  structures.  All  of  the  properties  and 
autocorrelation  test  results  that  were  compared  for  the  correctly 
applied  procedures  were  also  compared  for  the  incorrectly  applied 
ones.  These  results  are  also  summarized  in  Chapter  IV. 

TABLE  II 

ESTIMATION  PROCEDURES  INCORRECTLY 
APPLIED  TO  MODELS  AND  ERROR  STRUCTURES 


Error  Structure 

Model 

I;  Yt^Xt  +  Ut 

11 :  Yt  =  +  ^t  +  Ut 

AR(i) 

Ut  =  pVl  +  Vt 

Pesaran 

Zellner-Geisel 

H 

1 

-P 

> 

^ — 1 

2  -p 

> 

ii 

-p 

3 

Eeach-MacKi.nnon 

Wallis 

i 
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CHAPTER  II 


MODEL  I  ESTIMATION 

Model  I  is  a  simple  linear,  two  variable  econometric  model, 

v  (1) 

The  Beach-MacKinnor.  (1978)  or  B-M  procedure  is  used  to  estimate  this 
model  when  u  follows  an  AR(l)  error  structure  and  the  Pesaran  (1973) 

X 

or  PES  procedure  is  used  to  estimate  this  model  when  u^_  follows  an 
MA(l)  error  structure. 

In  this  chapter,  I  will  explain  each  procedure  in  detail  and  I 
will  discuss  the  Monte  Carlo  procedure  I  followed  in  evaluating  these 
two  procedures  against  OLS  and  Prais-Winston  or  P-W.  The  results  of 
the  evaluation  are  presented  in  Chapter  IV . 

The  Beach-MacKinnon  Procedure  for  Model  I-AE 

The  B-M  procedure  is  an  iterative  technique  which  alternates 
between  the  estimation  of  3  and  p  until  the  estimates  of  each  become 
arbitrarily  close  to  the  previous  iteration's  estimates. 

In  matrix  terms,  the  estimate  of  3»  8,  is  obtained  as  follows: 

b  =  (Z'Z)_1Z'w  (2) 

Z  =  QX 


where 


w  =  Qy 


(3) 

(4) 


10 


and 


Q  = 


(l-r2)2  0 

-r  1 

0  -r 


0  .  .  .  . 

1  0  •  • 


0 

0 

0 


0 


0  -r  1 


(5) 


For  the  first  iteration,  the  value  of  r,  the  estimate  of  p ,  is 
assumed  to  be  zero. 

To  estimate  p,  first  calculate  the  residuals  using  the  previous 
iteration's  estimate  of  g  and  the  original  X  and  Y  data,  that  is, 


e 


t 


-  bX 


t  * 


Then  calculate 


where 


¥' 


cos( 


0  +  TT 


0  =  COS 


-1 


2? 


i  a • d  2 • a2 \ 

(c  -~  +  “27') 


a2 

o.  j\2 


2- (3  -  d> 


(d  -h) 


(T  -  2).Zet*et_1 
"  (T  -  lMZe^  -  ej)' 

(T  -  l).e2  -  T»Ee2_1  -  He2 
(T  -  lMleJ^  -  e2) 

T>I:et,et-l 

(T  -  lMle2^  -  e2)' 


(6) 


(7) 


(8) 

(9) 

(10) 

(11) 


T  represents  the  number  of  observations  and  the  summations  run  from 
t  =  2  to  t  =  T. 
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The  B-M  procedure  was  programmed  in  FORTRAN  to  run  on  a  Control 


Data  Corporation  (CDC)  6400  computer.  This  program  is  available 
from  the  author  upon  request.  The  maximum  number  of  iterations  was 
set  at  ten  as  Beach  and  MacKinnon  (l9?8,p.53)  suggested  the  average 
number  of  iterations  should  run  between  four  and  seven  to  achieve 
five-digit  accuracy. 

The  Pesaran  Procedure  for  Model  I-MA 


The  PES  procedure  applies  the  simple  and  well-known  iterative 
method  of  False  Position^  to  the  first-order  condition  of  the 
concentrated  leg  likelihood  function 


4*£»Vet  "  T*(l  "  +  cos(~~)).w^e^  =  0  (12) 


where 


b  =  (Z 'WZ)_1Z  *WFy , 
e  =  Fy  -  Zb, 

Z  =  FX, 


(13) 

(14) 

(15) 


p  =  (— )’ 
VT  +  1' 


f 

f  „  . 

•  •  f* 

11 

12 

IT 

f 

f 

*  *  i'z-srr* 

21 

22 

2T 

• 

• 

• 

• 

• 

• 

• 

• 

• 

f 

f 

•  •  T 

T1 

T2 

TT 

(16) 


fij  *  sin^’ 


wt  =  {Xs  +  2»,\*cos(^~ ~)  +  1) 


i-l 


(17) 

(18) 


-1  -1 

and  W  is  the  inverse  of  the  diagonal  matrix  with  w^  ,  w^  ,  • 


^See  Hamming  (1971 .PP .45-48) . 


as  its  diagonal  elements .  T  represents  the  number  of  observations  and 


the  summations  run  from  t  =  1  to  t  =  T . 

Dr.  Pesaran  graciously  provided  a  copy  of  his  own  FORTRAN 

program  which  was  modified  slightly  in  order  to  incorporate  his  logic 

2 

into  my  Monte  Carlo  program.  Pesaran' s  own  cut-offs  were  used,  that 
is,  the  program  iterates  up  to  a  maximum  of  thirty  times  unless  the 

-4 

difference  between  successive  estimates  of  £  and  1  is  less  than  10 

The  Monte  Carlo  Procedure 


The  Procedure  in  General 

After  calculating  an  independent  data  series  of  the  appropriate 
length  using 


xt  -  6Xt-i  +  (19) 

where  0  =  0.8,  Xq  =  0.0  and  n,  NID(0,1C),  an  error  series,  u^,  was 
calculated  under  either  an  AP.(l)  or  an  MA(l )  error  structure.  I  used 
the  internal  pseudo  uniform  random  number  generator  provided  by  the 
FORTRAN  compiler  on  the  CDC  6400  computer^  and  a  normal  approximation 
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Pesaran  is  planning  to  publish  his  programs  under  the  title 
Dynamic  Regression:  Theory  and  Algorithms.  See  Pesaran  (1976). 

3 

To  validate  the  series  of  random  numbers  generated  by  the  CDC 
random  number  generator,  twenty-five  of  the  random  number  seeds  were 
chosen  at  random  from  the  280  seeds  required  to  complete  this  Monte 
Carlo  investigation.  These  twenty-five  seeds  were  then  used  to 
initiate  a  uniform  random  number  series  utilising  the  pseudo  uniform 
random  number  generator  provided  by  the  International  Mathematical 
and  Statistical  Library  (IMSL).  A  chi-square  goodness-of-fit  test 
was  run  on  each  of  the  series  initiated  by  the  twenty-five  seeds. 
Fourteen  times  out  of  the  twenty-five,  the  IMSL  series  had  a  better 
fit  than  CDC  and  eleven  times  out  of  twenty-five,  CDC  had  a  better 
series.  Additionally,  the  Monte  Carlo  runs  were  re-run  using  the 
IMSL  generator  and  the  results  compared  against  those  obtained  when 


A 
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routine  obtained  from  K.  M.  Wagner  (l9?5.P-93i0  to  generate  the  normal 
random  numbers .  A  different  random  number  seed  drawn  from  a  table  of 
random  digits  was  used  for  each  run  to  insure  different  random  number 
series . 

Using  these  X  and  u  series,  the  dependent  variable,  Y  , 

XX  X 

series  was  calculated  using  (l).  The  X  and  Y  series  then  became 

X  X 

the  data  to  be  regressed.  First  an  OLS  regression  was  calculated, 
the  estimate  of  j3  saved,  am  estimate  of  p  was  calculated  and  saved 
and  a  Durbin-Watson  d  test  was  performed  using  an  a  of  0.05.  If  the 
hypothesis  of  no  autocorrelation  was  rejected,  a  P-W  transformation 
and  estimation  procedure  was  performed,  the  new  estimates  of  =  and  p 
saved  and  a  new  Durbin-Watson  d  test  performed.  If,  following  the 
OLS  regression,  the  hypothesis  of  no  autocorrelation  was  not  rejected, 
the  P-W  estimates  for  this  replication  were  taken  to  be  the  same  as 
those  of  the  CLS  regression.  Following  this,  either  E-M  or  EES 
estimates  were  calculated  and  saved  and  another  Durbin-Watson  d  test 
performed . 

At  first,  the  computer  runs  were  made  for  the  correctly 
specified  error  structure,  that  is,  5-M  estimates  were  obtained  for 
Model  I -AH  and  PES  estimates  were  obtained  for  Model  I-MA.  Then  to 
check  the  effects  of  misspecification,  different  computer  runs  were 
made  for  the  misspecified  models;  B-M  estimates  were  obtained  fcr 
Model  I-MA  and  PE5  estimates  for  Model  I -AH. 

the  CBC  generator  was  used.  No  major  change  in  the  results  was 
observed}  the  differences  in  the  results  were  about  the  same  as  could 
be  expected  if  a  different  random  number  seed  was  used  -with  the  same 
random  number  generator. 


This  general  Monte  Carlo  procedure  was  replicated  100  times  for 
each  set  of  input  parameter  values.  This  provided  for  each  model  and 
for  each  set  of  input  parameter  values,  100  OLS  estimates,  100  B-M  or 


PES  estimates  and  100  P-W  estimates  as  well  as  the  results  of  the 
three  Durbin-Watsor.  d  tests.  The  number  of  times  the  Burbin-Watscn  d 
test  proved  inconclusive  was  collected  along  with  the  number  of  times 
the  hypothesis  of  no  autocorrelation  was  accepted  and  rejected.  The 
comparison  statistics,  discussed  later  in  this  chapter,  were 
calculated  on  the  100  estimates  collected  on  each  run. 

Input  Parameter  Values 

The  input  parameter  values  were  determined  after  examining 
recent  Monte  Carlo  studies  by  Beach  and  MacKinnon  (1978),  Rao  and 
Griliches  (1969)1  Maddala  and  Rao  (1973) >  Spitzer  (1979).  Kenkel 
(1975)  and  Hendry  and  Trivedi  (1972). 

Sample  sizes  of  20  and  50  were  selected.  The  independent 
variable  coefficient,  (3,  was  set  at  1.0  for  all  trials.  The  standard 
deviation  of  v  ,  C  ,  was  set  at  20  when  T  =  20  and  at  10  when  T  =  50. 
Ten  different  values  of  X  and  p  were  tried:  -.99.  -.8,  -.6,  -.3. 

-.1,  .1,  -3.  .6,  .8  and  .99-  There  were  then  twenty  different  sets 
of  input  parameter  values  for  both  3-M  and  PES  runs. 

Comparison  Statistics  Collected  and  Analyzed 

The  following  statistics  for  the  OLS ,  B-M  or  PES  and  P-W 
estimates  were  collected:  the  sample  mean,  the  sample  variance ,  the 
average  absolute  bias,  the  largest  and  smallest  absolute  biases,  the 
mean  squared  error  ( KSE )  and  the  Euclidian  distance  ala  Hendry  and 


Trlvedi  (1972, p. 122) ,  that  is 


E.D.  =  (H(pi 
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(20) 


where  represents  the  true  value  of  the  ith  parameter  to  be 

A 

estimated,  p^  represents  its  estimate  and  i  runs  from  i  =  1  to 
i  =  the  number  of  parameters  to  be  estimated — two  in  the  case  of 
Model  I.  The  Euclidian  distance  gives  a  measure  of  "aggregate"  bias. 

These  statistics  as  well  as  the  results  of  the  Durbin-Watson  a 
tests  were  then  output  at  the  completion  of  a  computer  run  on  a 
particular  set  of  input  parameter  values  and  a  particular  model.  The 
consolidated  results  for  all  runs  are  presented  in  Chapter  IV  along 
with  a  comparison  of  my  results  with  the  outcome  of  other  Mcr.te  Carlo 


studies  in  the  literature. 


CHAPTER  III 


MODEL  II  ESTIMATION 

Model  II  includes  a  single-period  lagged  endogenous  variable  and 
a  single  exogenous  variable 

L  ■  vt-i +  «t  *  v  (i) 

This  model  was  originally  derived  by  Koyck  (1954)  from  an  infinite 
geometric  lag  model  which  arises  when  the  adaptive  expectations  or 
partial  adjustment  models  are  used.  The  Zellner-Geisel  (1970)  or 
Z-G  procedure  was  developed  to  estimate  this  model  when  the  error 
term,  u^,  follows  an  MA(l)  structure.  Wallis  (196?)  developed  an 
estimation  procedure  to  handle  the  AR(l)  error  structure;  I  will  use 
WAL  when  referring  to  this  procedure. 

As  in  the  last  chapter,  first  I  will  describe  each  estimation 
procedure  separately  and  then  I  will  outline  the  Monte  Carlo 
procedure  I  used  to  analyze  the  small  sample  properties  of  these 
two  procedures  compared  to  those  of  OLS  and  Prais-Winston  (P-W).  The 
results  of  this  analysis  are  in  Chapter  IV 

The  Wallis  Three -Step  Procedure  for  Model  II -AR 

The  Wallis  procedure  is  basically  a  generalized  least  squares 
(GLS)  technique  that  Wallis  himself  separated  into  three  steps. ^ 

^This  procedure  is  not  to  be  confused  with  three -pass  least 


1? 


First,  estimate  the  coefficients  of  Model  II  via  OLS  but 
substitute  X^  ^  as  an  instrument  for  ^ .  Second,  using  the 
residuals  from  this  regression,  that  is, 

't  =  h  ~  Vt-1  -  Vf  <2> 

2 

calculate  an  estimate  of  p  making  a  correction  for  bias 


EVet-i 


T 


(3) 


where  T  represents  the  number  of  observations,  the  summation  in  the 
numerator  runs  from  t  =  2  to  t  =  T  and  the  summation  in  the 
denominator  runs  from  t  =  1  to  t  =  T.  Third,  use  this  estimate  of  p 
to  calculate  the  matrix 
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r 


2 


r 


r  1  r 


Q  = 


(4) 


T-l  T-2 
r  r 


1 


which  is  then  used  to  obtain  the  GLS  estimates  of  $1  and  ps 


b  =  (X,Q"iX)"1X’Q,"1y 


(5) 


squares.  In  fact,  Wallis  wrote  his  article  introducing  his  procedure 
in  response  to  a  previous  paper  by  Taylor  and  Wilson  (1964)  on 
three-pass  least  squares. 

^In  general,  the  correction  for  bias  is  k/T,  where  k  is  the 
number  of  parameters  to  be  estimated. 
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The  B-M  procedure  requires  the  least  amount  of  computer 
resources  as  it  does  not  iterate  nor  search;  only  two  least  squares 
matrix  inversions  are  required  for  each  replication. 

The  Zellner-Geisel  Procedure  for  Model  II -MA 

The  Zellner-Geisel  procedure  is  a  maximum  likelihood  (ML) 
technique  that  requires  that  Model  II  first  he  transformed.  After 
recognizing  that  in  this  model  is  really  X  and  after  defining 

wt  =  Yt  -  vt,  (6) 

you  can  obtain  the  transformed  model'' 

q  =  v‘ +  *  wt-i 

+  '^t-2  +  "  ’  *  +  4  *Xl)  +  vt‘  (?) 

Using  0L5  on  (7),  search  over  various  values  of  X  from  zero  to  one 
looking  for  the  minimum  residual  sum  of  squares.  The  program  which 
implements  the  Z-G  procedure  performs  a  grid  search  of  X  for  values 
between  zero  and  one  and  finds  to  five -digit  accuracy  the  set  of 
estimates  that  yield  the  minimum  residual  sum  of  squares. 

Both  procedures,  WAL  and  Z-G,  were  programmed  in  FORTRAN  to  run 
on  a  Control  Data  Corporation  (CDC)  6400  computer.  Both  programs  are 
available  from  the  author  upon  request. 

-^See  Johnston  (l9?2,pp.313~315)  for  the  derivation. 
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The  Monte  Carlo  Procedure 

The  Procedure  In  General 

The  data  generation  for  Model  II  is  exactly  the  same  as  for 

Model  I  except  that  after  the  calculation  of  the  series  and  the 

appropriate  u  series,  Model  II  was  used  to  generate  the  Y  series, 

X>  X, 

Yt  =  SiY^  +  g2Xt  +  ut,  (6) 


where  YQ  =  200  for  all  trials. 

As  with  Model  I ,  an  OLS  regression  was  performed  first  on  the 
generated  data  and  the  estimates  were  saved.  Using  the  residuals 
from  this  regression,  a  Durbin-Watson  d  test  was  performed  as  well  as 
a  Durbin  h  test  and  a  McNown  test. 

Durbin  (1970)  and  others  recognized  that  the  Durbin-Watson  a 
test  is  not  applicable  when  there  are  lagged  endogenous  variables 
as  regressors  in  the  model.  In  fact,  Griliches  (1961)  notes  that  the 
d  statistic  is  biased  toward  2.0  or  biased  toward  the  acceptance  of 
the  hypothesis  of  no  autocorrelation.  Both  Durbin  (1970)  and 
McNown  (1977)  have  developed  alternate  tests  to  be  used  in  place  of 
the  d  test. 

Durbin  suggests  the  use  of  his  h  test,  that  is  calculate 


(9) 


where  T  is  the  sample  size  and  r  is  an  estimate  of  p  based  on  the  OLS 
regression  residuals.  The  h  statistic  is  used  as  a  standard  normal 
deviate  to  test  the  hypothesis  of  p  =  0.  If  T»c^  >  1,  the  h  test  is 

Di 
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not  applicable  and  Durbin  suggests  an  alternate  test  for  this 
4 

situation. 

If  the  h  test  is  inapplicable,  the  following  regression  should 
be  performed  on  the  sample  residuals  using  0L3: 

e^  =  oc1e^_^  +  cc2Yt_^  +  asX^  +  yt  (l0) 

where 

et  =  Yt  "  biyt-i  -  b*xt  Cii) 

and  t  runs  from  t  =  2  to  t  =  T.  The  test  for  autocorrelation  of  u^  in 
(l)  consists  in  testing  the  significance  of  the  estimate  of  ax  in 

(io).5 

McNown  proposed  a  similar  test  to  be  used  in  place  of  the 
Durbin-Watson  d  for  lagged  endogenous  variable  models.  First  perform 
an  OLS  regression  of  the  following  model; 

Yt  =  aiYt-l  +  °*2Yt-2  +  a3Xt  +  a*Xt-l  +  6t  (12) 

where  t  runs  from  t=3tot=T.  To  test  the  hypothesis  of  no 
autocorrelation  of  u^  in  (l),  perform  a  significance  test  on  the 
estimate  of  a4  in  (12) . 

If,  based  on  the  outcome  of  the  McNown  test,  the  hypothesis  of 
no  autocorrelation  of  u^  in  (l)  was  rejected,  a  Prais-Winston  (?-Vi) 
transformation  was  performed  on  the  data  and  new  estimates  cf  px,  $2 
and  X.  or  p  were  calculated  and  saved. 

^See  Durbin  (1570). 

^Out  of  200,000  sets  of  generated  data  for  Model  II,  the 
Durbin  h  test  was  inapplicable  only  twelve  rimes. 


Finally,  either  a  Z-G  or  WAL  procedure  regression  was  performed, 
the  estimates  saved  and  a  Durbin-Watson  a  test  was  performed  using  the 


residuals  from  this  regression.  If  the  WAL  procedure  was  used,  a 
Durbin  h  test  was  performed  in  addition  to  the  d  test. 

As  with  Model  I ,  the  Monte  Carlo  procedure  just  outlined  was 
replicated  100  times  for  each  set  of  input  parameters  and  for  each 
error  structure.  Also  each  procedure,  Z-G  and  WAL,  was  applied  not 
only  to  the  error  structure  for  which  it  was  developed,  but  also  to 
the  incorrect  error  structure. 

Input  Parameter  Values 

As  stated  in  Chapter  II,  the  values  of  the  input  parameters 
were  determined  after  a  review  of  the  recent  literature.  Sample  sizes 
of  20  and  50  were  tried  as  with  Model  I .  The  coefficient  of  the 
exogenous  variable,  g2,  was  set  at  1.0  for  all  trials.  The 
coefficient  of  the  lagged  endogenous  variable,  3X,  was  set  at  either 
0.4  or  0.8  if  the  VfAL  procedure  was  being  run;  if  a  Z-G  procedure  was 
being  run,  j3x  was  set  equal  to  the  input  value  of  X.  or  p. 

The  standard  deviation  of  v  ,  0  ,  was  calculated  based  on 
signal-to-noise  ratios  of  either  10  or  100.  Maddala  and  Hao 
(1973»P*769)  derived  the  white  noise  variance,  a2,  from  the  signal- 
to-noise  ratio,  G,  when  the  error  structure  was  AR( 1 )  obtaining  the 
following; 

-  2  2 

2  -  1  +  8,Pi  .  1  -  p*3i  .  1  -  pa  .  Pa*gu 
v  1  -  1  +  P*3i  1  -  93  G 

where  9  =  0.3  and  a2  =  10.0  throughout  all  trials.  The  corresponding 
M- 


22 

formula  for  an  MA(l)  error  structure  is  as  follows: 


where  0  =  0.8  and  cJ“  =  10.0  throughout  all  trials. 

Ten  different  values  of  A.  and  p  were  tried;  -.99.  -.8,  -.6, 
”•3.  -.1,  .1,  .3.  .6,  .8  and  .99-  However,  because  the  Z-G 
procedure  searches  only  between  zero  and  one  for  the  value  of  the 
autocorrelation  parameter,  only  positive  values  of  and  p  were 
attempted  for  the  Z-G  runs. 

The  combination  of  all  these  input  parameter  values  represents 
twenty  different  sets  for  Z-G  runs  and  eighty  different  sets  for 
WAL  runs. 


y2„,2 

1  9»3x  i  _ P2  u _  1_ 

1  -  0*3!  1  -  e£  '  (1  -r  A2  -  2.\.pJ  ’  G 


Comparison  Statistics  Collected  and  Analyzed 

The  same  comparison  statistics  were  collected  and  analysed  for 
Model  II  as  were  collected  and  analyzed  for  Model  I .  The  results  of 
my  analysis  are  presented  and  discussed  in  the  next  chapter  along 
with  comparisons  of  my  results  to  those  found  in  the  literature. 
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CHAPTER  IV 


RESULTS  AMD  CONCLUSIONS 

In  this  chapter,  the  estimated  parameter  comparison  statistics 
results  are  presented  before  the  results  of  the  autocorrelation 
detection  tests.  Within  each  section  of  results,  Model  I  will  be 
discussed  before  Model  II .  But  before  proceeding  to  the  results  an 
explanation  of  Table  III  through  Table  XVIII  is  required. 

Table  III  throught  Table  XVIII  contain  the  percentage 
differences  between  the  estimated  parameter  comparison  statistics  of 
two  regression  procedures  for  two  or  three  parameters.  For  each 
parameter  that  was  estimated,  the  percentage  differences  for  bias, 
variance  and  mean  squared  error  (MSS)  are  displayed  along  -with  the 
percentage  difference  for  the  Euclidian  distance  (E.D.).X  Each  row  of 
a  table  represents  a  different  set  of  input  parameter  values. 

The  percentage  difference  was  calculated  using  this  formula: 

IPb|  "  I?  I 

Percentage  Difference  =  - : - : -  •  ICO  (l) 

lpbl 

where  p^  represents  the  estimated  parameter  comparison  statistic  for 
the  second  procedure  listed  in  the  title  of  the  table  and  p^ 

^"Although  the  largest  absolute  bias  and  the  smallest  absolute 
bias  statistics  were  collected  and  analyzed,  they  added  little  cr 
nothing  to  the  understanding  of  the  problem.  Consequently ,  these 
results  are  not  included  in  the  tables. 


represents  the  estimated  parameter  comparison  statistic  for  the  first 
procedure  listed.  For  example,  the  first  procedure  listed  in  Table 
III  is  Pesaran  and  OLS  is  the  second. 

The  sign  cf  the  percentage  difference  gives  a  qualitative 
comparison  between  the  two  procedures  and  the  absolute  value  of  the 
percentage  difference  gives  a  quantitative  comparison  between  the  two. 
If  the  percentage  difference  is  negative,  the  estimated  parameter 
comparison  statistic  for  the  second  procedure  was  less  than  that  for 
the  first  procedure  and  the  larger  the  absolute  value  of  the 
percentage  difference,  the  greater  the  difference  between  the  two 
procedures.  On  the  other  hand,  if  the  percentage  difference  is 
positive,  the  first  procedure  was  better  than  the  second,  but  a  word 
of  caution  is  in  order  in  analyzing  the  quantitative  difference 
because  the  percentage  difference  is  not  symmetric  about  zero.  When 
the  percentage  difference  is  positive,  the  largest  value  possible  is 
100  due  to  the  way  in  which  it  is  calculated.  A  percentage  difference 
of  50  is  comparable  to  a  negative  percentage  difference  of  -100;  a 
percentage  difference  of  75  is  comparable  to  a  negative  percentage 
difference  of  -300;  88  is  comparable  to  -700;  90  to  -900;  95  to  -1300 
through  -2100;  98  to  -3900  through  -5900;  ar.d  a  percentage  difference 
of  100  is  comparable  to  any  negative  percentage  difference  less  than 
-19,900. 

Comparison  Statistic  Results  for  Model  I 

Since  Model  I  is  a  simple  linear  regression  model,  OLS  estimates 
•will  be  unbiased  but  not  asymptotically  efficient  when  the  error  term 
is  autoccrrelated .  We  are  therefore  most  interested  in  the  small 
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sample  variance  but  we  cannot  overlook  the  bias  in  these  small 
samples.  The  results  for  EES  and  3-M  applied  to  their  correct  error 
structure  will  be  discussed  before  the  results  of  the  incorrectly 
applied  error  structures. 

Pesaran  Procedure  on  MA(l) 

The  results  on  the  Pesaran  procedure  are  mixed  but  in  general 
both  OLS  and  P-W  seemed  to  be  slightly  better.  Looking  at  Table  III 
and  Table  IV,  more  negative  percentage  differences  can  be  seen  than 
positive  ones.  The  percentage  differences  for  X  are  nearly  all 
negative  meaning  that  OLS  and  P-W  were  almost  always  better  than  EES; 
this  is  quite  surprising  considering  that  EES  is  truly  estimating  X 
while  OLS  and  P-W  both  estimated  p  as  a  proxy  for  If  PES  had  any 
edge  over  OLS  and  P-W  on  the  estimation  of  X,  it  would  be  in  the  area 
of  X's  variance  when  X  was  large,  either  negative  or  positive. 

PES  showed  up  better  in  the  estimation  of  3  where  for  both 
large  positive  and  negative  values  of  X,  PES  had  some  rather  large 
positive  percentage  differences. 

The  most  disturbing  result  of  the  EES  runs  on  Model  I-MA  was 

that  all  three  procedures  estimated  the  wrong  sign  on  X  for  all 

2 

twenty  sets  of  input  parameter  values.  OLS  and  P-W  were  really 
estimating  o  and  that  may  well  be  the  reason  for  these  two  procedures 
obtaining  the  wrong  sign,  but  the  EES  procedure  is  designed  to 
estimate  X  and  the  fact  that  it  estimated  the  wrong  sign  makes  this 
procedure  suspect. 

^This  result  cannot  be  seen  from  the  tables. 
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Since  the  PES  procedure  uses  the  False  Position  algorithm  as 
mentioned  in  Chapter  II  and  this  algorithm  approaches  the  root  of  the 
function  from  both  sides  of  zero,  it  is  possible  that  the  algorithm 
may  not  work  and  therefore  the  PES  procedure  cannot  be  performed.  The 
program  that  performed  the  PES  procedure  kept  track  of  the  number  of 
times  it  was  impossible  to  perform  the  PES  procedure  because  of  the 
failure  of  the  algorithm.  In  no  case  on  the  PES  runs  on  Model  I-MA 
did  the  algorithm  experience  a  failure. 

Likewise,  the  P-W  procedure  requires  the  taking  of  a  square  root 
and  applying  it  to  the  first  observation.  Therefore,  it  is  quite 
possible  that  the  value  under  the  square  root  radical  could  be  nega¬ 
tive  making  it  impossible  to  perform  P-W.  All  four  procedure  programs 
kept  track  of  the  number  of  times  it  was  impossible  to  perform  P-W  due 

to  a  negative  value  under  the  square  root  radical .  During  the  PES 

runs  on  Model  I-MA,  this  problem  never  occurred. 

As  mentioned  in  Chapter  II,  Pesaran’s  own  cut-offs  of  thirty 
iterations  or  parameter  differences  of  less  than  10  were  used.  The 
PES  procedure  reached  the  thirty  iteration  cut-off  rarely  when  T  was 

50  but  when  T  was  20,  the  thirty  iteration  cut-off  was  reached 

frequently  with  low  values  of  \  and  almost  every  time  with  values  of 
near  ±  1 . 

Some  reluctance  should  accompany  the  use  of  the  PE'S  procedure 
to  estimate  A  and  p  for  Model  I-MA,  A  tetter  alternative  appears  to 
be  that  cf  ?-W  since  on  all  comparison  statistics  it  was  better  than 
CL3 .  The  P-W  estimate  of  p  would  have  to  be  used  as  the  estimate  for 
X  but  as  can  be  seen  from  the  tables,  the  P-W  estimate  of  p  does  r.ct 


appear  to  be  a  poor  proxy  for  \ . 
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Beacn-MacKinnon  Procedure  on  AR(l) 

Rao  and  Griliches  (1969)  found  that  non-linear  maximum  like¬ 
lihood  procedures  like  Beach-MacKirmon  are  no  improvement  over  simpler 
two-stage  procedures  like  Prais-Winston  in  samples  of  the  same  size. 
But  Spitzer  (1979)  In  trying  to  duplicate  the  work  of  Rao  and 
Griliches  came  to  the  opposite  conclusion.  My  results  tend  to  favor 
Spitzer. 

Looking  at  Table  V,  the  percentage  differences  for  p  are  rather 
small  and  that  for  large  positive  and  negative  values  of  p,  the  E-K 
variance  is  less  than  that  for  OLS .  Bias  favors  OLS  when  p  is 
negative  but  B-M  had  the  lower  bias  for  positive  values  of  p .  The 
results  for  MSS  are  very  close  to  those  for  variance. 

As  for  ;3,  the  percentage  differences  are  larger  than  for  p  ar.d 
all  three  comparison  statistics  seem  to  indicate  that  B-M  is  better 
than  OLS  when  p  is  large.  The  Euclidian  distance  also  tends  to  favor 
B-M  when  p  is  near  ±  1 ,  especially  for  positive  values  > 

Table  VI  compares  B-M  to  P-W  and  the  results  are  generally  the 
same  as  for  CL3  except  that  the  percentage  differences  tend  to  be 
larger,  especially  for  the  estimation  of  P.  For  negative  p,  there 
is  no  clear  winner  on  the  bias  of  3 • 

A  surprising  outcome  cf  the  3-M  runs  on  Model  I -AH  was  than  OLS 
tended  to  do  a  better  job  in  terms  of  Euclidian  distance  than  ?-W .  In 
terms  of  the  other  comparison  statistics,  P-W  did  do  a  better  jcb  than 
OLS  on  the  estimation  of  3;  OLS  came  out  better  on  the  estimation  cf 
p .  CLS  had  some  very  large  sample  variances  on  the  estimate  of  p 
especially  when  |p|  was  large  ar.d  this  accounted  for  the  P-W  showing. 


Due  to  negative  values  under  the  square  root  radical,  ?-W 


estimates  could  not  be  calculated  when  |p|  =  .99  on  several  sets  of 
data  although  for  far  less  than  half  of  the  number  of  sets  of  data 
generated  and  for  far  fewer  times  when  T  was  50. 

Since  there  are  two  different  square  roots  to  be  taken  in  the 
B-M  procedure — see  (5)  and  (7)  in  Chapter  II — and  there  is  the  chance 
that  the  procedure  might  try  to  take  the  square  root  of  a  negative 
number,  it  is  possible  that  the  3-M  procedure  cannot  be  performed. 

The  B-M  procedure  program  kept  track  of  the  number  of  times  it  was 
impossible  to  perform  a  3-M  procedure  due  to  negative  values  under  the 
square  root  radical.  In  no  case  was  it  impossible  to  calculate  3-M 
estimates  on  Model  I -AH. 

The  ten  iteration  cut-off  was  reached  only  rarely  but  when  it 
was,  it  was  more  likely  that  the  input  value  of  p  was  near  zero. 

In  general,  the  B-M  procedure  appears  to  be  quite  reliable  for 
the  estimation  of  Model  I -AH. 


Beach-MacKinr.on  Procedure  or,  MA(l) 

The  Beach-MacKinnon  procedure  does  a  job  nearly  comparable  to 

CLS  on  the  estimation  of  X  but  it  is  inferior  to  ?~W  cn  the  estimation 

of  ,\.  The  estimation  cf  3  tends  to  favor  B-M  ever  booh  OL5  and  ?-iv. 

3 

Table  VII  presents  the  results  of  B-M  versus  CLS  and  although 
there  axe  many  negative  percentage  differences  for  the  three 
comparison  statistics  for  X,  only  a  few  are  in  double  digits  ar.d  many 
are  zero.  So  technically  OLS  is  the  winner  but  r.ot  by  much.  As  for 

''Since  3-M  is  really  trying  to  estimate  p,  all  three  procedures 
are  estimating  p  as  a  proxy  for  X;  hence  the  p/\  label  in  Table  VII 
and  Table  VIII . 
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the  estimation  of  p,  the  percentage  differences  are  larger  and  many  of 
them  are  positive  indicating  that  B-M  was  better  than  0L3 .  The  bias 
column  seems  to  be  sprinkled  with  negative  values  in  a  random  fashion 
but  for  positive  and  large  negative  values  of  X,  3-M  had  the  edge  over 
OLS .  The  Euclidian  distance,  which  is  an  "aggregate"  bias,  indicates 
that  there  was  little  difference  between  B-M  and  013. 

Table  VIII  has  far  more  negative  percentage  differences  ar.d  the 
values  are  much  larger  especially  for  X.  and  the  Euclidian  distance. 

The  results  on  ,8  are  roughly  the  same  as  for  B-M  versus  OLS . 

At  no  time  was  it  impossible  to  do  a  B-M  procedure  or  a  P-W 
procedure  because  of  a  negative  value  under  the  square  root  radical. 
The  average  number  of  iterations  required  for  a  3-M  regression  was 
slightly  higher  with  the  smaller  sample  size  and  the  average  fell  as 
the  value  of  X  grew.  The  ten  iteration  cut-off  was  reached  more  often 
with  a  negative  X  than  with  a  positive  one. 

As  with  the  EES  runs  on  Model  I-MA,  P-W  came  out  the  clear 
winner  against  OLS;  all  comparison  statistics  favored  P-W. 

Again,  however,  all  three  procedures  estimated  the  wrong  sign 
on  X  for  all  possible  sets  of  data.  I  believe  the  reason  for  the 
wrong  sign  estimate  is  that  the  form  of  an  MA(l)  error  structure — (c) 
in  Chapter  I — has  a  -X  whereas  the  form  of  an  AR ( 1 )  error  structure  — 
(b)  in  Chapter  I — has  a  positive  p.  In  ether  words,  the  regression 
procedures  are  trying  to  estimate  -X • 

Pesaran  Procedure  on  A3(l) 

The  Pesaran  procedure  turns  cut  to  be  clearly  inferior  to  CL3 
in  terms  of  the  estimation  cf  o;  as  for  the  comparison  cf  fl'f  or.:;  ill 


on  3  and  for  PES  versus  P-W,  the  results  are  mixed.  These  results 
are  displayed  in  Table  LX  and  in  Table  X . 
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On  the  estimation  of  3  ,  the  Pesaran  procedure  appears  to  have  ar. 
edge  over  CLS  in  terms  of  variance  and  MSE  when  0  is  near  one;  in  the 
cases  of  bias  and  Euclidian  distance,  PES  seemed  better  than  0L3 
when  p  was  positive. 

Comparing  PES  to  P-W,  you  can  see  almost  the  opposite  results 
to  PES  versus  OLS.  PES  was  better  than  P-W  in  the  estimation  of  p  for 
large  values  of  p;  this  was  also  true  for  Euclidian  distance.  In 
terms  of  the  estimation  of  3 ,  P-W  appeared  to  be  better  than  PES  when 
p  >  0.6. 

In  results  not  tabulated,  PES  and  OLS  almost  consistently 
overestimated  3  when  T  was  set  at  50.  At  no  time  was  it  impossible  to 
perform  a  PES  procedure  but  when  J p|  -  .  99>  the  P-W  procedure  couldn't 
be  performed  10-20  times  out  of  100  due  to  negative  values  under  the 
square  root  radical.  The  thirty  iteration  cut-off  on  the  PES 
procedure  was  reached  more  often  for  large  values  of  p  ar.d  for  T  =  20. 

Comparing  OLS  and  P-W,  P-W  was  only  slightly  tetter  than  OLS  on 
the  variance  of  both  p  and  3 •  On  bias  and  MSE ,  P-W  was  generally 
better  than  OLS  on  the  estimation  of  3  ,  while  OLS  was  better  on  p .  In 
addition,  OLS  was  generally  better  than  P-W  in  terms  of  Euclidian 
distance . 

The  fact  that  P-W  did  not  prove  superior  to  CLS  on  Model  I -AH 
still  concerns  me  and  I  am  unable  to  account  for  it.  Both  Rao  and 
Griliches  (1969)  and  Spitzer  (1979)  found  P-W  to  be  one  of  the  best 
two-stage  procedures  cut  I  am  disappointed  in  its  performance  when  the 
error  structure  was  AR(l).  However,  P-W  did  seem  to  remove  the  ARfi) 
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autocorrelation  as  will  be  shown  later  in  this  chapter  ar.d  it  did. 
seem  to  do  a  better  job  than  OLS  when  the  error  structure  was  MA(l). 

Before  ending  this  section  on  Model  I,  I  think  it  would  be 
instructive  to  repeat  the  results  concerning  whether  it  was  possible 
to  perform  a  P-W  procedure  remembering  that  if  (l  -  rz)  <  0,  a  ?-W 
transformation  cannot  be  performed.  In  no  case  under  an  MA(l)  error 
structure  was  it  impossible  to  perform  P-W  and  under  an  AR(l)  error 
structure  only  large  values  of  p  caused  the  value  under  the  square 
root  radical  to  become  negative  and  thereby  make  it  impossible  to 
perform  P-W.  This  suggests  that  if  a  P-W  procedure  can't  be 
performed,  it  is  quite  likely  that  the  error  structure  is  not  MA(l). 

Comparison  Statistics  Results  for  Model  II 

Model  II  is  a  single-period  lagged  endogenous  variable  and 
single  exogenous  variable  econometric  model.  If  the  error  term  is 
autocorrelated ,  QL3  will  provide  biased,  inconsistent  and  inefficient 
estimates.  The  small  sample  bias,  therefore,  is  the  comparison 
statistic  of  most  interest  but  the  other  comparison  statistics  provide 
noteworthy  information  as  well.  The  results  for  Z-C-  ana  WAL  applied 
to  their  correct  error  structures  will  be  discussed  first,  then  the 
results  of  the  incorrectly  applied  error  structures  will  be  presented. 

Zellner-Teisel  Procedure  or.  MA(l) 

The  Zellner-Geisel  procedure  would  appear  to  be  the  procedure  of 
choice  when  the  value  of  a.  is  near  one .  Table  XI  displays  mostly 
positive  percentage  differences  when  X  >  0.6  and  Table  XII  is  nearly 
all  positive  percentage  differences.  In  addition,  the  percentage 
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differences  arrayed  in  Table  XII,  which  compares  Z-G  to  P-W,  are 
almost  all  90  or  above  indicating  that  Z-G  was  clearly  better  than 
P-W  based  on  the  outcome  of  the  McNown  test . 

The  large  negative  percentage  differences  for  the  estimation  of 
X  in  Table  XI  for  low  values  of  X  is  quite  consistent  with  Morrison 
(l9?0).  He  found  that  OLS  was  good  for  small  A.  and  small  signal-to- 
noise  ratios  but  that  it  degenerated  rapidly  as  X  and  the  signal-to- 
noise  ratio  got  larger.  He  also  felt  that  maximum  likelihood  leaning 
techniques  gave  the  best  estimates. 

Morrison  reported  that  the  degeneration  of  the  OLS  estimates  as 
X  and  the  signal-to-noise  ratio  increased  was  due  to  the  fact  that  the 
OLS  estimates  displayed  a  strongly  negative  bias .  OLS  did  provide 
estimates  of  X  that  almost  consistently  underestimated  the  true  value 
of  X.  However,  OLS  did  not  consistently  underestimate  pE.  P-W  did 
underestimate  X  more  often  than  not  and  P-W  far  underestimated  in 
all  cases  although  the  bias  was  less  as  X  and  the  signal-to-noise 
increased.  Morrison's  results  and  mine  do  conflict  with  Griliches 
(l96l)  who  concluded  that  as  long  as  in  Model  II  is  positive,^ 

OLS  will  overestimate  it.  Under  an  AR(l)  error  structure,  CLS 
overestimated  during  the  Z-G  runs  eighteen  times  out  of  twenty  but 
during  the  WAL  runs  on  both  MA(l)  and  AR(l)  error  structures,  CLS 
more  often  than  not  underestimated  px. 

As  could  be  discerned  from  the  above  discussion,  OLS  was 
clearly  superior  to  P-W;  this  superiority  was  evident  in  all 
comparison  statistics.  In  other  ncn-tabulated  results,  the  bias, 

Recall  that  for  Model  II-MA,  3 r  and  X  are  the  same. 
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variance,  MSE  and  the  Euclidian  distance  statistics  tended  to  fall  for 


both  parameter  estimates  and  for  all  procedures  as  the  signal -to-noise 
ratio  increased.  This  last  result  is  not  unexpected. 

In  his  analysis  of  distributed  lag  estimators,  Sargent  (1963) 
concluded  that  there  was  not  much  difference  between  special 
procedures  and  DLS .  My  results  tend  to  point  to  a  different 
conclusion:  Z-G  is  a  useful  procedure  especially  when  k  is  large. 

Wallis  Procedure  on  AR(l) 

OLS  proved  to  be  better  than  WAL  for  all  three  parameter 
estimates  for  all  values  of  p  and  except  when  p  >  0.3  and  px  =  0.8. 
WAL  proved  better  than  P-W  for  positive  values  of  p. 

The  results  of  the  comparison  between  WAL  and  OLS  are  provided 
in  Table  XIII  which  had  to  be  continued  on  two  additional  pages 
because  of  the  eighty  possible  sets  of  input  parameter  values.  The 
results  for  WAL  versus  P-W  can  be  found  in  Table  XIY  which  is  also 
three  pages  long. 

As  can  be  seen  in  Table  XIII ,  the  large  negative  percentage 
differences  for  the  comparison  statistics  for  the  estimate  of  C  and 
for  the  Euclidian  distance  when  the  value  of  p  is  negative  indicate 
that  OLS  was  better  than  WAL.  Although  not  all  the  comparison 
statistics  for  the  estimates  of  px  and  have  negative  percentage 
differences  nor  are  the  differences  as  great  as  those  for  the 
estimate  of  p,  most  of  the  percentage  differences  are  negative.  For 
negative  values  of  P,  CLS  is  the  choice  ever  WAL. 

The  CLS  superiority  continues  through  small  positive  values  of 
P  when  for  values  of  P  >  0.3,  more  and  more  of  the  percentage 
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differences  are  positive  indicating  that  WAL  was  better  than  OLS . 

But  notice  that  even  for  large  positive  values  of  p,  when  =  0.4, 
the  percentage  differences  for  the  estimate  of  p  are  negative  or 
favoring  OLS  over  WAL. 

As  for  WAL  versus  P-W,  WAL  is  favored  on  the  estimate  of  3i  for 
nearly  every  set  of  input  parameter  values.  WAL  is  also  favored  on 
the  estimate  of  for  values  of  p  >  -0.6  but  WAL  is  favored  over  OLS 
on  the  estimate  of  p  only  for  p  >  0.3* 

In  non- tabulated  results,  WAL  overestimated  p  when  p  <  0.6;  WAL 
underestimated  3i  sixty  times  out  of  eighty  and  nearly  every  time  that 
=  0.4.  Additionally,  WAL  estimated  a  positive  p  in  twenty-six 
times  out  of  forty  when  p  was  negative  but  every  time  the  true  p  was 
positive,  WAL  estimated  it  as  positive.  P-W  underestimated  in 
nearly  every  case  but  both  OLS  and  P-W  always  obtained  the  correct 
sign  on  the  estimate  of  p. 

Concerning  OLS  versus  P-W,  in  all  but  the  variance  of  the 
estimate  of  p,  OLS  was  better  than  P-W.  The  variance  of  the  estimate 
of  p  favored  P-W  slightly  over  CLS  for  mid-range  values  of  p,  but  for 
large  positive  and  negative  values  of  p ,  OLS  was  better  than  ?-W  even 
for  the  variance  of  the  estimate  of  p . 

Although  the  signal-to-noise  ratio  had  no  appreciable  effect  on 


the  estimate  of  p,  the  bias,  variance  and  MSE  for  the  estimates  of 
31  and  3 2  tended  to  fall  for  all  procedures  when  the  value  of  the 
signal-to-noise  raoio  went  from  10  to  100. 
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Wallis  Procedure  on  MA(l) 

The  results  for  this  misspecified  model  axe  not  very  cleax-cut. 
Both  the  bias  on  the  estimate  of  X  and  the  Euclidian  distance  favor 
OLS  over  WAL  for  negative  values  of  X  but  they  favor  WAL  over  OLS  for 
positive  A..  Otherwise,  the  results  are  mixed. 

Table  XV  and  Table  XVI  provide  the  results  on  the  WAL  runs  on 
Model  II-MA.  The  value  of  had  an  important  effect  on  the  results. 
When  =  0.4,  WAL  is  favored  over  OLS  either  by  an  outright  positive 
percentage  difference  or  by  a  reduction  in  the  size  of  the  negative 
percentage  difference  when  px  =  0.8.  This  result  is  most  marked  when 
X  is  negative  and  can  be  seen  in  the  comparison  statistics  for  all 
three  parameter  estimates  as  well  as  for  the  Euclidian  distance. 

Comparing  WAL  and  P-W,  WAL  comes  out  the  clear  choice  in  terms 
of  the  estimates  of  3i  and  g2  as  nearly  every  percentage  difference 
for  these  two  parameter  estimates  is  95  on  above.  As  for  the 
estimate  of  X ,  the  variance  tended  to  favor  P-W  through  all  values  of 
X  but  both  the  bias  and  the  MSE  favored  P-W  for  negative  values  of  X 
while  WAL  was  better  for  positive  values  of  X .  The  Euclidian  distance 
statistic  favored  P-W  for  X  <  -0.3  but  favored  WAL  for  X  >  -0.6. 

As  was  the  case  with  Model  I-MA,  the  CLS  and  P-W  procedures 
estimated  the  wrong  sign  on  X  in  all  cases  for  the  WAL  runs  or. 

Model  II-MA.  The  WAL  procedure  in  all  but  nine  cases  estimated  a 
positive  X;  the  nine  cases  where  WAL  estimated  a  negative  X  occurred 
when  the  true  value  of  X  >  0.3  and  when  T  was  50. 

The  OLS  versus  P-W  comparison  yeilded  mixed  results.  The  bias, 
variance  and  MSE  on  the  estimate  of  X  favored  P-W  tut  OLS  was  almost 
consistently  tetter  than  P-W  on  the  estimation  of  31  and  ps.  As  far 


as  the  Euclidian  distance,  OLS  tended  to  be  better  than  P-W  fcr 


mid-range  values  of  A.  but  for  large  positive  and  negative  values  of  ). , 
?-W  was  generally  better  than  OLS . 

The  signal-to-noise  ratio  had  an  effect  on  the  pi ' s  once  again. 
The  bias,  variance  and  MSE  on  the  estimates  of  the  3's  tended  to 
become  smaller  as  the  signal-to-noise  ratio  increased. 

Zellner-Geisel  Procedure  on  Ap(l) 

This  misspecified  error  structure  model  yielded  quite  clear 
results.  OLS  did  a  better  job  than  Z-G  and  Z-G  was  generally  much 
better  than  P-W . 

Table  XVII  provides  the  results  of  the  comparison  between  Z-G 
and  OLS  and  there  are  far  more  negative  percentage  differences  than 
positive  ones.  The  verdict  is  not  quite  unanimous  but  I  would  choose 
OLS  over  Z -G . 

Table  XVIII  provides  the  results  of  the  comparison  between  Z-G 
and  P-W.  There  are  some  very  large  positive  percentage  differences  in 
this  table  but  the  largest  negative  percentage  differences  I  found  in 
the  eight  different  sets  of  runs  I  made  can  be  seen  under  the  variance 
and  MSE  columns  for  32  when  p  =  .99*  The  Z-G  procedure  calculated 
estimates  for  3s  when  p  was  large  that  yielded  enormous  variances; 
for  example,  when  T  was  50.  the  signal-to-noise  ratio  was  10  and  p 
was  .99,  the  Z-<G  procedure  obtained  a  sample  variar.ee  for  the  estimate 
of  p2  that  was  equal  to  3145*33  as  compared  to  CLS's  44.29  ana  P-W's 
1.96.  The  exact  reason  for  this  is  unknown  but  it  may  well  be  that 
the  interaction  between  the  values  of  the  input  parameters  and  the 
grid  searcr.  on  X  yielded  some  local  instead  of  global  minimums. 
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In  non- tabulated  results,  both  0L3  and  ?-W  almost  consistently 
overestimated  p  while  ?-V  almost  consistently  overestimated  32  as 
well,  OLS  was  generally  better  than  P-W  in  all  comparison  statistics 
for  both  parameter  estimates.  Once  again,  as  the  signal-to-noise 
ratio  increased,  the  bias,  variance  and  MSE  for  both  parameter 
estimates  tended  to  fall. 

Autocorrelation  Test  Results  for  Model  I 
IiA(l)  Error  Structure 

The  hypothesis  tested  by  the  Durbin-VIatson  d  test,  namely 
Hq!  p  =  0,  does  not  strictly  apply  to  the  case  where  the  error 
structure  is  MA(l)  but  since  the  d  statistic  is  calculated  and 
printed  by  most  canned,  regression  programs,  the  outcome  of  the  use  of 
this  test  under  the  MA ( 1 )  error  structure  is  quite  remarkable. 

Table  XIX  arrays  the  percentage  of  time  the  hypothesis  of  r.c  A?.(l) 
autocorrelation  was  accepted  and  rejected  fcr  the  various  input 
parameter  value  sets  for  both  the  RES  and  3-M  runs  on  Model  I-MA. 

Sven  though  the  d  test  is  r.ot  applicable  to  an  MA(l)  error 
structure,  Table  XIX  exhibits  a  typical  power  function.  At  values  of 
\  near  1.0  and  -1.0,  the  hypothesis  was  accepted  after  an  QLS 
regression  only  a  few  times  especially  when  T  =  jC.  At  values  of  A. 
near  zero,  the  hypothesis  was  accepted  the  majority  of  the  time. 

-'The  number  of  times  the  Durbin-Watscn  i  test  proved  to  be 
inconclusive  was  collected  but  was  not  tabulated.  The  inconclusive 
percentage  can  ce  obtained  by  adding  together  the  percentages  of 
acceptances  and  rejections  and  subtracting  the  total  from  100. 
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This  power  of  the  a  test  to  detect  MA(l)  autocorrelation  has 
been  reported  before.  Smith  (1976)  found  the  d  test  was  satisfactory 
and  uniformly  better  than  Durbin's  periodogram  (1969).  Geary's  sign 
test  (1970)  and  Schmidt's  dx  +  da  test  (1972).  Also  Elattterg  (1973) 
found  the  power  of  the  Durbin-Watson  d  test  quite  satisfactory  for 
MA(l)  autocorrelation. 

Even  though  both  Fitts  (1973)  and  Godfrey  (1978)  have  developed 
tests  for  MA(l)  autocorrelation,  it  is  not  poor  policy  to  rely  on  the 
d  test.  Fomby  and  Guilkey  (1978)  suggested  that  for  AF.(l)  autocorre¬ 
lation  detection  an  a  level  of  O.5O  or  better  should  be  used.  If  a 
higher  a  level  were  used  for  testing  for  MA(l)  autocorrelation,  the  d 
test  would  likely  be  quite  adequate. 

The  Pesaran  procedure  was  developed  to  handle  Model  I-KA  and 
looking  at  Table  XIX,  this  procedure  has  apparently  removed  the  MA(i) 
autocorrelation  from  the  residuals  as  car.  be  seen  by  the  acceptance 
percentages  being  at  least  66.  But  there  is  hardly  any  difference 
between  the  FES  and  2-M  columns  of  the  table,  that  is  the  r-M 
procedure  seemed  to  also  remove  the  MA ( 1 )  autocorrelation.  In 
addition,  both  of  these  procedures  appeared  to  do  a  better  job  than 
Prais-Winston  at  removing  the  autocorrelation. 

AR(l)  Error  Structure 

The  results  for  the  Durbin-Watson  d  test  for  Model  I -AT.  are 
given  in  Table  XX  which  was  arrayed  exactly  like  Table  XIX  to 
facilitate  comparison  of  the  two  tables . 

As  can  be  seen  by  looking  at  the  Beach-MacKinncr.  cr  right  hand 
typical  power  fur.cti  _r.  was  obtained  when  the  i 


side  of  Table  XX ,  a 
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test  was  applied  after  ar.  CL3  regression:  low  acceptance/high 
rejection  for  values  of  p  near  1.0  and  -1.0  and  high  acceptance/low 
rejection  percentages  for  p  values  near  zero.  As  for  the  B-M  and 
P-W  columns,  the  expected  result  is  also  displayed.  Since  both  B-M 
and  P-W  are  designed  to  handle  AR(l)  autocorrelation,  the  a  test 
acceptance  rate  should  be  quite  high  regardless  of  the  value  of  p . 
This  is  what  can  be  seen  cut  B-M  seemed  to  do  a  better  job  than  P-W 
especially  when  p  =  .99* 

As  is  obvious  from  the  acceptance  and  rejection  percentages 
under  the  PES  column,  it  is  evident  that  with  high  negative  and 
positive  values  of  p ,  the  PES  procedure  did  not  do  an  acceptable  job 
of  removing  AP.(l)  autocorrelation.^ 

Autocorrelation  Test  Results  for  Model  II 
MA(l)  Error  Structure 

Again  with  the  understanding  that  these  tests  were  not  designed 
to  detect  MA(l)  autocorrelation,  Tables  XXI  and  XXII  show  the  results 
of  using  the  Durbin-W'atson  d,  the  Me  Mown  and  the  Durbin  h  tests  on 
MA(l)  autocorrelation.  Table  XXI  shows  the  percentage  of  acceptances 
and  rejections  for  the  Z-G  runs  or.  Model  II -MA;  Table  XXII  shows  the 
same  for  the  WAL  runs . 

The  McNown  test  administered  after  an  GL3  regression  appears  to 
be  unreliable  for  small  samples.  It  did  a  better  job  when  T  =  and 

The  zero  acceptance,-  zero  rejection  for  ?-.<  under  the  Pesarar. 
runs  for  T  =  20  and  p  =  -0.1  ana  the  triple  dashes  under  the  Beach- 
MacXiuror.  runs  is  due  to  the  fact  that  a  P-W  procedure  was  performed 
only  if  there  was  a  rejection  by  the  Durbin-Watscn  d  test. 
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for  the  larger  sigr.al-to-noise  ratio — this  result  is  especially  clear 
in  Table  XXII --but  both  the  h  and  d  tests  appear  to  have  been  more 
powerful.  The  McNown  test  also  exhibits  an  interesting  anomaly  when 
k  nears  the  value  of  one  in  Table  XXI,  The  percentage  of  rejections 
tends  to  show  an  increasing  trend  and  the  percentage  of  acceptances, 
a  decreasing  trend  as  X  increases  from  a  value  of  0.1  but  these  trends 
are  markedly  broker,  when  the  value  of  k  =  .99*  This  anomaly  shows  up 
for  the  Z-G  runs  but  it  is  absent  in  the  WAL  runs.  It  appears  again, 
interestingly  enough,  in  the  Z-G  runs  on  Model  II -AR  to  be  discussed 
later  in  this  chapter.  Why  it  appears  in  the  Z-G  runs  and  not  in  the 
WAL  runs  is  unclear  to  me;  the  McNown  test  was  administered  only 
after  an  OLS  regression  and  since  the  data  generation  was  the  same 
for  either  the  Z-G  or  WAL  runs  for  a  given  error  structure,  the 
anomaly  should  have  appeared  under  both  runs.  Neither  the  Z-G  nor  the 
WAL  procedure  should  have  had  any  effect  on  the  outcome  of  the  McNowr. 
test . 

The  Durbin  h  test  administered  after  an  OLS  regression  appears 
to  be  reliable;  it  is  more  powerful  for  the  larger  sample  size,  this 
being  especially  evident  for  |,\|  >  0.3  in  both  tables. 

The  Durbin-Watsor.  d  test  again  appears  reasonably  powerful  in 
detecting  MA(l)  autocorrelation.  If  you  consider  the  percentage  of 
ir.conclusives — not  arrayed  in  the  tables — as  rejections,  the  d  nest 
would  be  more  powerful  than  Turbin's  h. 

Looking  at  the  d  test  results  after  a  Z-G  regression,  the  Z-G 
procedure  seemed  to  do  a  passable  job  at  removing  the  XA(i) 
autocorrelation.  Most  cf  the  rejection  percentages  are  below  five 


-.d  all  are  below  ten. 


And  in  all  cases  the  acceptance  percentages 
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are  higher  after  a  Z-G  procedure  regression  than  after  ar.  OLS 
regression.  The  d  test  results  after  a  WAL  procedure  regression 
indicate  that  the  WAL  procedure  did  not  do  an  acceptable  job  of 
removing  MA(l)  autocorrelation.  In  fact,  the  results  seen  to  parallel 
rather  closely  the  outcome  of  the  d  test  following  an  CLS  regression 
and  indicate,  if  anything,  that  the  WAL  procedure  tended  to  create 
autocorrelation  rather  than  remove  it. 

The  h  test  administered  after  a  WAL  procedui'e  regression  also 
indicates  not  only  that  the  WAL  procedure  did  net  remove  the  ilA(l) 
autocorrelation  but  also  increased  the  chance  of  rejecting  the 
hypothesis  of  no  autocorrelation  when  A.  <  0. 

The  value  of  did  not  appear  to  affect  the  outcome  of  the 
tests  for  autocorrelation  and  other  than  the  effect  mentioned  on  the 
McNown  test,  the  signal-to-ncise  ratio  did  not  seem  to  play  a  part 
ir.  the  outcome  of  the  tests  either. 


AB(l)  Error  Structure 

The  results  of  the  autocorrelation  tests  for  Model  II -Ar.  are 
arrayed  in  Tables  XXIII  and  XXIV ;  Table  XXIII  contains  the  results 
for  the  Z-G  runs  and  Table  XXIV,  the  results  for  the  WAL  runs. 

The  McNown  test  and  the  Turbin  h  test  viere  designed  specifically 
to  detect  AF.  autocorrelation  in  a  lagged  endogenous  model  and 
therefore  they  should  be  more  powerful  than  the  Durbir.-Watscr.  d  test 
which  Turbin  himself  (1970)  said  is  not  applicable  to  the  lagged 
endogenous  case.  Yet,  the  Turbin-Watson  d  test  exhibited  generally 
fewer  acceptances  than  either  the  McNown  or  the  h  test  especially  for 
tr.e  3iT.a_ij.er  sample  sice,  .-or  jO|  >  u.j,  tne  percentages  cr  acceptance 
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and  rejection  for  the  d  test  and  the  h  test  are  very  nearly  the  same 
indicating  that  both  were  equally  powerful. 

The  fact  that  my  results  indicate  that  the  I  urt  in-  V.'  at  son  d  test 
and  Durbin's  h  test  are  about  equal  in  power  conflicts  with  previous 
results  of  Kenkel  (19?4;19?5;1976) ,  Park  (1975; 1576)  and  Spencer 
(1975) •  Kenkel  and  Park  battled  each  other  in  the  literature  over 
the  power  of  the  h  and  d  tests;  Kenkel  found  that  d  -was  better  than  h 
and  Park  countered  that  h  was  better  than  d.  Spencer  sided  with 
Kenkel;  he  found  evidence  of  a  serious  small  sample  bias  in  the  h  test 


in  that  it  tended  to  detect  serial  correlation  when  none  existed.  My 
results,  as  pointed  out  above,  indicate  that  if  there  is  a  bias,  the 
h  test  tended  toward  a  higher  level  of  acceptance  than  the  d  test 
when  T  =  20. 

The  anomaly  mentioned  earlier  with  respect  to  the  McNown  test 
can  be  seen  in  Table  XXIII.  For  values  of  p  <  0.3,  the  McNown  test 
results  paralleled  those  cf  the  d  and  h  tests  but  when  the  value  of 
P  -  .99.  there  was  a  sharp  reversal  of  the  trend  of  acceptances  and 
rejections.  As  you  can  see  by  locking  at  Table  XXIV,  the  McNown 
test  does  not  exhibit  this  behavior  during  the  WAL  runs.  Again  I  an 
at  a  loss  to  explain  this.  However,  with  a  p  value  so  close  to  1.0, 
it  is  entirely  possible  that  unexpected  things  night  happen  during 
the  CLS  regression  of  (12)  in  Chapter  III. 

Neither  the  Z-G  procedure  nor  the  V/AL  procedure  was  very 
successful  at  removing  the  Ar.(l)  autocorrelation  even  though  the 
NAL  procedure  was  specifically  designed  for  it.  The  Durbin-Watson  d 
test  exhibits  rather  high  levels  cf  rejection  after  a  Z-G  run  and 


both  the  Durbin-Watson  d  test  and  the  Durbin  h  test  shew  high  levels 
of  rejection  after  a  UAL  procedure  especially  when  p  is  close  to  one. 

Even  though  the  signal-to-noise  ratio  did  r.ct  seer,  to 
appreciably  affect  the  results,  the  value  of  px  did  have  an 
interesting  effect.  For  |p|  >  0.3,  when  went  from  G.d  to  0.3,  the 
level  of  acceptances  for  the  d  test  rose  and  the  level  of  rejections 
fell. 


Conclusions  and  Recommendations 

As  is  true  of  any  Monte  Carlo  investigation,  the  results  should 
not  be  generalised  to  any  great  extent.  Strictly  speaking,  the 
results  are  only  valid  for  the  exact  input  parameter  values  tried  and 
for  the  exact  error  structures  used.  The  reader  is  warned  than, 
applying  the  results  of  this  investigation  to  values  of  parameters 
or  error  structures  not  actually  tried  should  be  done  cautiously. 

That  said,  I  will  now  summarize  the  results  of  this  Monte  Carlo 
investigation . 

Pesaran  Procedure 

The  Pesaran  procedure  was  developed  to  handle  Model  I-MA  but  in 
general  both  0L3  and  ?-W  were  better  than  PSS  cn  the  estimation  of 
but  worse  cn  she  estimation  of  £ .  The  Durtir. -Watson  d  test  seemed  to 
indicate  that  this  procedure  removed  the  >IA(i)  autocorrelation.  Cn 
AP.fi )  autocorrelation,  this  procedure  displayed  the  same  properties 
except  that  with  high  values  of  p ,  it  did  not  remove  the  An(i) 


autocorrelation . 


Use  of  P-U  is  recommended  over  PES;  the  estimate  of  p  would  be 
used  as  a  proxy  for  A. . 

Beach-MacKinnor.  Procedure 

The  Beach-MacKinnon  procedure  was  developed  to  handle  Model  I-AE 
and  the  data  seemed  to  point  toward  its  superiority  over  both  0L3  and 
?-W .  Under  an  MA(l)  error  structure,  the  results  are  not  as  clear, 
however.  None  of  the  procedures,  OLS,  P-J  or  B-M,  was  able  to 
estimate  the  correct  sign  on  A  so  caution  is  advised  here.  B-M  did 
slightly  better  on  the  estimate  of  j3  ir.  terms  of  its  variance  and  ICE 
but  both  OLS  and  ?-W  were  better  in  terms  of  the  absolute  bias  of  the 
estimate  of  A.  B-M  also  appeared  to  remove  both  MA(l)  and  AR(l) 
autocorrelation. 

The  use  of  B-M  is  recommended  over  both  OLS  and  ?-W  unless 

theoretically  the  sign  on  the  estimate  of  p  is  doubted.  In  that  case, 

analyze  the  autocovariances  of  the  disturbances  of  an  OLS  regression 

7 

to  determine  if  the  error  structure  might  be  MA,  If  the  disturbance 
appears  to  be  MA(l),  use  of  P-W  is  recommended. 

Zellr.er-Geisel  Procedure 

The  Zellner-Geisel  procedure  was  designed  for  Model  II-MA  and 
the  results  indicated  that  it  was  far  superior  to  ?-W  and  better  than 
OLS  for  large  values  of  a..  Furthermore,  it  seemed  to  do  a  passable 
job  at  removing  the  MA(l)  autocorrelation.  On  AR(l)  autocorrelation, 
this  procedure  holds  its  own  against  CLS  especially  for  large  values 
of  p  in  terms  of  its  bias.  The  Z-G  procedure  was  clearly  better  than 

'Zee  Box  and  Jenkins  (197c)  or  Pir.dyck  and  p.ubinfeid  (197-)  for 
information  cn  how  to  analyze  autocovariar.ee  functions. 
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P-W  based  on  the  outcome  of  the  McNown  test.  However,  the  Z-G 
procedure  did  not  appear  to  remove  AR(i)  autocorrelation. 

It  is  recommended  that  the  Z-G  procedure  be  performed  and  a 
Durbin -Watson  d  test  on  the  residuals  be  accomplished.  If  the 
hypothesis  of  no  autocorrelation  is  rejected,  then  look  at  the 
autocovariance  function  of  the  residuals.  If  an  MA(l)  error  structure 
seems  appropriate  and  the  value  of  ,Y  >  0.6,  use  the  Z-G  procedure 
estimates  with  some  confidence.  If,  however,  an  AR(l)  error 
structure  seems  more  appropriate,  or  an  MA(l)  error  structure  with  a 
small  value  for  Y.  use  of  OLS  on  the  data  is  recommended. 

Wallis  Three -3 ten  Procedure 

The  Wallis  procedure  was  designed  for  Model  II -A?..  The  results 
were  not  favorable.  OLS  was  better  than  WAL  in  terms  of  bias, 
variance,  MSS  and  Euclidian  distance.  This  was  true  even  when  the 
error  structure  was  KA(l).  Additionally,  based  on  the  results  of  the 
Durbin -W atsor.  d  test,  the  WAL  procedure  did  not  do  an  acceptable  job 
removing  either  MA(l)  or  AR(l)  autocorrelation. 

In  certain  circumstances  WAL  performed  better  than  ?-W  based  on 
the  outcome  of  the  McNown  test  but  use  of  CL3  is  recommended  when  an 
AR(l)  error  structure  has  been  specified.  Try,  also,  either  the 
Iteration  on  a  Serial  Correlation  Parameter  procedure  discussed  by 
Sargent  (1966)  cr  Klein's  Nonlinear  Maximum  Likelihood  procedure 
(1958).  Sargent  found  these  two  methods  were  the  best  in  his  study. 

Durbin-Watsor.  d  Test 

The  Durbin-Watson  d  test  proved  to  be  quite  powerful  at 
detecting  the  presence  of  not  only  an  AR(l)  error  structure  cut  also 
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an  MA(l)  error  structure.  It  was  more  powerful  than  the  Durbin  h  test 
when  the  error  structure  was  MA(l )  and  the  model  was  Model  II -MA. 

And  it  was  at  least  as  good  if  not  better  than  Durbin's  h  for 
Model  II  with  an  AR(l)  error  structure. 

It  is  indeed  fortuitous  that  most  canned  regression  packages 
calculate  and  output  the  Durbin-Watson  d  statistic  ar.d  its  continued 
use  is  recommended. 

McHown  Test 

In  small  samples,  the  McNown  test  appeared  to  be  roughly  as 
powerful  as  the  Durbin-Watson  d  test  when  the  error  structure  was 
AR(l),  It  was  not  very  reliable  when  the  error  structure  was  MA(l). 
Additionally,  the  P-W  estimates  for  Model  II — calculated  only  if  the 
McNown  test  indicated  rejection  of  the  no  autocorrelation  hypothesis — 
tended  to  be  inferior  to  the  0L3 ,  Z-C-  and  WAL  estimates.  Re-running 
the  Model  II  computer  runs  basing  the  P-W  estimates  on  the  outcome  of 
the  Durbin-Watson  d  test  or  the  Durbin  h  test  is  suggested  as  a 
possible  extension  of  this  investigation. 

Use  of  the  McNown  test  on  small  samples  is  not  recommended;  use 
the  Durbin-Watson  d  test  instead. 

Durbin  h  Test 

The  Durbin  h  test  seemed  to  be  adequate  in  detecting  both  An(i) 
and  MA(l)  autocorrelation.  But  the  Durbin-Watson  d  test,  even  though 
theoretically  it  should  not  be  applied  when  lagged  endogenous 
variables  are  used  as  regressors,  performed  at  least  as  well  if  not 
better  than  the  h  test  on  A?.(i)  data  and  outperformed  the  h  test  or. 

MA ( 1 )  data. 


Because  of  the  questionable  nature  of  Durbin's  h  test  in 
recent  studies  and  because  the  Durbin-Watson  d  test  could  detect  both 
AR(l)  and  MA ( 1 )  autocorrelation  in  small  samples,  use  of  the  d  test 
over  the  h  test  is  recommended . 
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